Handling Structural Ambiguity in a Knowledge-Based Information Retrieval System

نویسندگان

Bas van Bakel

Erik Oltmans

چکیده

This paper presents a strategy to handle syntactic ambiguity in a theoretically motivated fashion following general linguistic principles. This strategy, which is called underspecification, was implemented in a Natural Language Engine (NLE) for automatic information extraction, called ELSA (an acronym for English Language Semantic Analyser), which was developed at the Department of Language & Speech of the University of Nijmegen. The crucial idea of the strategy is that, in case of ambiguity, the NLE should know what option to choose and when to choose it. Until that moment the analysis remains underspecified, i.e. only one derivation is produced. At present time, the NLE in question is adapted to serve as the linguistic module of a knowledge-based Information Retrieval System, called Condorcet, being developed at theUniversity of Twente, for documents on the fields ofmechanical properties of engineering ceramics as a subfield of engineering, and epilepsy as a subfield of medicine. In this paper we will show how a theory-driven NLE will make a substantial contribution to (semi)automatic information retrieval, making use of the AGFL system. The authors are greatly indebted to Nicolaas J.I. Mars and Paul E. van der Vet, the initiators of the Condorcet project, for their substantial contribution to this article. Chapter

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature

Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...

متن کامل

Behavioral Considerations in Developing Web Information Systems: User-centered Design Agenda

The current paper explores designing a web information retrieval system regarding the searching behavior of users in real and everyday life. Designing an information system that is closely linked to human behavior is equally important for providers and the end users. From an Information Science point of view, four approaches in designing information retrieval systems were identified as system-...

متن کامل

Knowledge Sources for Textual CBR Applications

Textual CBR applications address issues that have traditionally been dealt with in the Information Retrieval community, namely the handling of textual documents. As CBR is a knowledge-based technique, the question arises where items of knowledge may come from and how they might contribute to the implementation of a Textual CBR system. In this paper, we will show how various pieces of knowledge ...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل